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Abstract: 

It is shown here that the equivalence class of an expression under the congruence closure of any 
finite set of equations between ground terms is a context free expression language. An expression is either a 
symbol or an n-tuple of expressions; the difference between expressions and strings is that expressions have 
inherent phrase structure. The Downey, Sethi, and Tarjan algorithm for computing congruence closures can 
be used to convert a finite set of equations 2 to a context free expression grammar G such that for any 
expression u the equivalence class of u under 2 is precisely the language generated by an expression form 
F(u) under the grammar G. The fact that context free expression languages are closed under intersection is 
used to derive an algorithm for computing a grammar for die equivalence class of a given expression under 
any finite disjunction of finite sets of equations between ground expressions. This algorithm can also be used 
to derive a grammar representing the equivalence class of conditional expressions of the form 
if P then u else v. The description of an equivalence class by a context free expression grammar can also be 
used to simplify expressions under "well behaved" simplicity orders. Specifically if G is a context free 
expression grammar which generates an equivalence class of expressions then for any well behaved simplicity 
order there is a subset G’ of the productions of G such that the expressions generated by G’ are exactly those 
expressions of the equivalence class which are simplicity bounds and whose subterms are also simplicity 
bounds. Furthermore G’ can be computed from G in order nlog(n) time plus the the time required to do 
order nlog(n) comparisons between expressions where n is the size G. 
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1. INTRODUCTION 


There are several areas in which it is important to be able to "solve" for certain expressions. The 
most familiar example is in applied mathematics where one is interested in "solving” a system of simultaneous 
equations. A related problem is that of answering data base queries of the form "what is x?". In general one 
is presented with a collection of terms, some methods for showing equivalences between terms, and the 
problem of "solving for" or "simplifying" some particular term. 

One motivation for algorithms which solve for expressions arises from procedural attachment. 
Consider a data base consisting of a collection of first order sentences and suppose that the binary function 
symbol + is intended to denote ordinary addition over the integers. Clearly it is possible to directly compute 
the sum of two numerals and thus one can benefit from associating the function symbol + with such an 
addition procedure (such procedural attachment in FOL is described by Weyhrauch [1]). However to apply 
the addition procedure to a term such as +[flx], f[y]] one must solve for its subterms, in this case fix] and fly], 
in terms of numeric constants. Thus the usefulness of procedures attached to function or predicate symbols 
can depend upon the ability to solve for expressions. 

Another motivation for algorithms that solve for expressions arises from Moore’s notion of a rigid 
designator [2]. Moore addresses the philosophical question of when an individual (or a data base) "knows" 
the meaning of some term. For example when does an individual "know" John’s phone number. Moore 
/^""N addresses this question by first defining a modal logic with a Kripke style semantics and then defining the 
notion of a "rigid designator" as a term, such as die numeral 2, which has the same meaning in all possible 
worlds. Moore then defines "knowing the meaning of a term” such tiiat a person knows the meaning of the 
term phonc#[John] just in case there is a rigid designator d such that the person can deduce that 
phone# [John] equals d. The details of Moore’s constructions are not important here other than to note that 
he makes a distinction between terms which may denote different things in different worlds and rigid 
designators which can not. Now consider a data base consisting of sentences in Moore’s logic and a question 
of die form "what is phonc#]John]?". Intuitively one should expect the system to either answer "I don’t 
know", or to return a rigid designator such as the numeral 2537884. Thus the problem of answering a query 
of die form "what is x?" can be reduced to the problem of "solving" for x in terms of a rigid designator. 

In general suppose that some set of symbols has been identified as "independent" (or rigid in 
Moore’s sense) and all other symbols are considered "dependent". The problem addressed here is that of 
taking a finite set of equalities 2 between ground terms and a particular term u and "solving" for u in terms of 
the independent symbols by performing substitutions of equals for equals. As an example consider the three 
equations a = JMa]]], b - f[a] and c = l]b] and suppose that the symbols f and c are taken to be 
independent. It is possible to solve for b in terms of f and c, specifically b = ffflc]]. As another example 
consider the equations a = f]b c], b = g[a c], and c = f]g[a c] c] where g and c are taken to be independent. 
It follows from these equalities that b = g[c c]. 

The only technique for showing equivalences used by the procedures described here is the 
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substitution of a ground expression for an equivalent ground expression, 'lire problem of solving for an 
expression using substitution as the only means of showing equivalences may seem like a very special case of 
the problem of solving for an expression using arbitrary techniques for showing equivalences. However the 
substitution of equals for equals can play an important role in cases where other techniques for showing 
equivalences arc also sound. For example consider a set of simultaneous linear equations which can be 
"solved" using some standard matrix inversion procedure. Steele and Sussman [3] propose an alternative to 
the standard matrix inversion techniques which they call "constraint propagation". Constraint propagation 
can be more efficient than matrix inversion when the matrix of coefficients is sparse. Constraint propagation 
also seems to model the way people often solve sets of equations. In constraint propagation a set of equations 
is expanded to a larger set by solving each equation locally for each variable appearing in that equation. 
There is then a "propagation" phase based purely on the substitution of equals for equals. This 
"propagation" phase of the process could be handled by the algorithms presented here. A more complete 
discussion of the relationship between constraint propagation and substitution of equals for equals is given in 
[4]. 

The problem of solving for an expression given a set of equalities between ground terms can be 
usefully generalized in two ways. First instead of considering a set of equations one can consider an arbitrary 
Boolean formula built up from equalities between ground terms. For example die following Boolean formula 
implies the equality f(x) = I. 

^ (x = a V x = b) A f]a] = 1 A f[b] = 1 

A second way the problem can be generalized is to consider simplification under some simplicity 
order on terms. Solving for an expression is just a special case of simplification. To see this consider the 
partial order on terms defined by making a term u "simpler" than a term v just in case v contains dependent 
symbols while u does not. Under such a simplicity order solving for a given term is equivalent to 
"simplifying" that term to an expression containing only independent symbols. 

The importance of the more general problem of simplifying terms under a predefined simplicity 
order can be seen by considering a set of simultaneous equations in applied madrcmatics. Numerals and 
terms constructed purely from numerals and "known" function symbols such as + are in some sense simpler 
than other terms. Often symbolic "constants" arc used in applied mathematics and terms constructed purely 
from numeric and symbolic constants arc in some sense simpler than terms which contain "variables". 
Variables are sometimes divided into dependent and independent variables and terms which contain no 
dependent variables are in some sense simpler than terms which do not. Also the size (or some other measure 
of complexity) can be an important factor in'determining a terms simplicity. 

The above problems are approached here by establishing a relationship between equivalence classes 
and context free expression grammars. A context free expression grammar is just like a context free string 
grammar except that it describes a set of expressions (terms) rather than a set of strings (it is important to note 

f "A 

that an expression is a tree and therefore has an inherent phrase structure while a string has no such 
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structure). A finite set 2 of equalities between ground expressions induces a congruence relation on 
expressions where two expressions are congruent just in ease they can be proven equal from the equalities in 
2. Such a congruence relation will be called a finitely equational congruence relation. A finitely equational 
congruence relation can be represented by a context free expression grammar. Given a finite set 2 of 
equalities between expressions the congruence closure algorithm of Downey, Sethi, and Tarjan [5] can be used 
to construct a context free expression grammar G such that for any expression u the equivalence class of u is 
the language generated by an expression form F(u) under G. Furthermore G can be constaicted from 2 in 
order «log(«) average time where n is the size of 2. 

Let u be an arbitrary expression and let Q be an arbitrary consistent Boolean expression built up 
from equalities and the standard Boolean connectives A, and V. Using the fact that context free 
expression languages are closed under intersection it is possible to construct a context free expression 
grammar G which generates the set of expressions which can proven equivalent to u given the formula Q. 
This procedure can be used to compute a grammar for the class of simple expressions which are equivalent to 
a conditional expression of the form if Q then u else v. 

Finally a procedure is developed for simplifying expressions under an arbitrary well behaved 
simplicity order. Let G be a grammar such that the equivalence class of an expression u is the language 
generated by an expression form T(u) under G. For any well behaved simplicity order there is a subset G’ of 
the productions of G such that the language generated by r(u) under G’ is precisely the set of simplifications 
of u. The subset G’ can be computed from G in order nlog(n) time plus the time required to do order nlog(n) 
comparisons between expressions where n is the size of G. 

It is hoped that the ability to represent finitely equational congruence relations with context free 
expressions grammars will provide both a deeper understanding of such congruence relations and a more 
flexible computational framework in which to perform deduction. 
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2. EXPRESSIONS, EQUATIONS, AND GRAMMARS 

The algorithms described here work just as well on second order terms as on first order terms. Thus 
expressions which are intended to denote functions or predicates arc not handled any differently from 
expressions denoting domain elements. In fact the algorithms described here do not depend on any "typing" 
at all. This observation motivates the following definition of the set of all expressions over an alphabet A. 

Definition: An expression over an alphabet A is either a symbol in A or an n-tuple 

<u^ U 2 ... u n > of expressions over A. 

This definition of an expression is similar to the definition of an s-expression in LISP. While all of 
the algorithms described here operate on expressions they work just as well when restricted to typed 
expressions or first order terms. 

Consider a set of equalities 2 between expressions. If 2 was a set of equalities between first order 
terms then it would be clear what equalities follow from 2. However since expressions are in some sense 
more general than first order terms an explicit definition of tine set of equalities which are deducible from 2 is 
given below (it is based on the standard deductive properties of equality). There is a simple semantics (which 
will not be presented here) for expressions under which the following notion of "deducible" is both sound 
and complete. 

Definition: The set of equalities deducible from a set of equalities 2, is the smallest set which 

contains 2 and which satisfies is the following deductive principles: 

Reflexivity: For any expression u, u = u is deducible. 

Symmetry. If u = v is deducible from 2 then v = u is deducible from 2. 

Transitivity : If both v = w and u = w are deducible from 2 then u = v is 
deducible from 2. 

Subslitulivity : If the equalities u^ = v^, U 2 = V 2 >... u n = v n are all deducible 
from 2 then the equality <u^ U 2 ... u n > = <V 2 v ^ — v n > is deducible from 2. 

The following definitions and lemma 2.1 provide some basic concepts relating sets of equalities to 
equivalence relations on expressions. While these notions may seem obvious and redundant they facilitate 
precision in later sections. 

Definitions'. Let ~ be an equivalence relation on expressions. The relation ~ will be said 

to subsume an equality u = v just in case u is equivalent to v under i.e. just in case u~v. 

The relation ~ will be said to subsume a set of equalities 2 just in case it subsumes every 
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equality in 2. The relation ~ is called a congruence relation just in case it is substitutive, i.c. 
whenever the equivalences Uj~\ ; j, u 2 ~v 2 , ... u n ~v n hold, the equivalence 
<tij u 2 ... u n >«<v 1 v 2 ... v fi > also holds. 

lemma 2.1 : Congruence relations are deductively closed, i.e. if ~ is a congruence relation 
and 2 is a set of equalities subsumed by ~ then any equality dcducible from 2 is also 
subsumed by ~. 

The following definition provides another characterization of the set of equalities dcducible from a given set 

2 . 


Definition : The congruence closure of a set of equalities 2 is the equivalence relation 
on expressions which subsumes exactly those equalities which are dcducible from 2, i.e. 
u~ 2 y just in case the equality u = v is deducible from 2. The equivalence class of an 
expression u under the congruence closure of 2 is denoted [ul^. Tire congruence closure of 
a finite set 2 will be called a finitely equational congruence relation. 

It is interesting to note that (uj^ can be an infinite set even when 2 is finite. For example if 2 is 
{<f a> = a) then lal^ includes all expressions of the form <f<f ...<f a>...». The main result of this section 
will be that jul^ can be described by a context free expression grammar as defined below. 

Definition: Let A be an alphabet of terminal symbols and let A be a collection of 

non-terminal symbols such that A is disjoint from A. A context free expression grammar 
over A and A is a set of productions of the form X=>a where X is a non-terminal symbol 
and a is an expression over A union A. 

Expression grammars are very much like string grammars. Each non-terminal symbol of an 
expression grammar generates a set of expressions over the terminal alphabet in the same way that each 
non-terminal of a context free string grammar generates a set of strings. For example the grammar consisting 
of tire productions A=>a and A=><f A> describes the set of expressions of the form <f <f..,<f a>...». 

Let A be a set of terminal symbols and A be a set of non-terminal symbols. Expressions over A 
(expressions containing only terminal symbols) will be called terminal expressions (or simply expressions) and 
will be denoted by the letters u, v, and w. Expressions over A union A will be called expression forms and will 
be denoted by the greck letters a, /?, and y. -For a given grammar G the relation =^* is defined on expression 
forms as the smallest relation satisfying the following three properties: 
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1) For any expression form a, a=>*a. 

2) IfX=>a is a production ofG then X=>*a. 

3) If a=>*(} then for any expression form <y^... a ...y n > containing a, 

<Yl ... « ,.-Y n >=>*<Yi... •y n > 

Intuitively a=>*(i just in case /? can be derived from a by replacing some number of non-terminal 
symbols in a by expressions which they generate. The language generated by an expression form a is defined 
to be the set of terminal expressions u such that a=>*u. 

The investigation of context free expression grammars is motivated by the existence of a relationship 
between a certain class of such grammars and finitely equational congruence relations on expressions. The 
following definition identifies the relevant class of grammars. 

Definition: A context free expression grammar will be called normal if the following two 
conditions hold. First all productions are either of the form X=>a where a is a terminal 
symbol, or of the form X=><Y} Y 2 ... Y n > where each Yj is a non-terminal symbol. Second 
there are no two productions X=>a and Y=>/3 such that X and Y are distinct non-terminals 
but such that a and /? are the same expression form. 

It turns out that any normal grammar G represents a congruence relation on expressions. In general 
all expressions generated by a given expression form a will be equivalent under this relation. However not all 
expression forms generate entire equivalence classes. In particular if a=>*/3 then the language generated by 
p may be only a proper subset of the language generated by a. It turns out that the expression forms which 
generate entire equivalence classes are precisely die maximal expression forms where maximal is defined as 
below: 


Definition: An expression form a will be called maximal under an expression grammar G 
just in case there is no expression form /? other than a such that /?=> *a under G. 

Consider the equivalence class of the expression <g a> under the congruence closure of the equation 
<fa> = a. The following normal grammar "represents" the congruence relation on expressions imposed by 
this equality (the first "production" below represents in the standard way the pair of productions A=>a and 
A=><F A>). 


A => a | <F A> 

F => f 

The equivalence class of <g a> under the equality <f a> = a is exaedy the language generated by the 
expression form <g A> under the above grammar. Note that die expression forms A and <g A> are maximal 
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under this grammar while the expression form <f A> is not. 

For any function T on expressions let denote the equivalence relation such that u~pv just in 
case F(u) equals F(v). Furthermore for any equivalence relation ~ on expressions and any expression v let 
|v|~ denote tire equivalence class of v under 22. The following theorem establishes tire first half of the 
relationship between normal expression grammars and finitely equational congruence relations. 


Lemma 2.2: Let G be any normal context free expression grammar. For each expression u 
there exists a unique expression form T(u) which is maximal under G and which generates u. 
Thus there exists a unique function T, called the maximal generator function of G, which 
maps each expression u to the maximal expression T(u) which generates u. Furthermore 
~p is a finitely equational congruence relation and for any expression u the language 
generated by T(u) is precisely |u|~ . 


proof: The function T is defined recursively via the following conditions: 


1) For any terminal symbol a if there is a production of the form X=*a then T(a) is X otherwise 
T(a) is a. 

2) If u is of the form Cu-^ u 2 ... u n > then if there is a production of the form 
X=><r(u 1 ) T(u 2 ) ... T(u n )> then T(u) equals X otherwise T(u) equals <T(u 1 ) T(u 2 )... T(u n )>. 

Since the normality of a grammar ensures that no two distinct productions have the same right 
hand side, the above definition is well formed. Note that T(u) can be computed in linear time in 
tlie size of u (in general membership in a context free expression language can be determined in 
linear time although this will not be proven here). 

A simple induction on expressions can be used to show that for any expression u, T(u)=>*u 
(the details are left to the reader). It will now be shown by induction that for any expression u, T(u) 
is maximal. To see this first note that in a normal grammar each non-terminal symbol is a maximal 
expression form (the only way a non-terminal symbol X could fail to be maximal is if there was 
some other non-tenninal symbol Y such that Y=t>X was a production of G, but this is not allowed 
in normal grammars). Now if a is a terminal symbol then either T(a) is a non-tenninal symbol 
(which is a maximal expression form) or T(a) is a. Rut if F(a) is a then by the definition of T there 
can be no production of G whose right hand side is a in which case a is a maximal expression fonn. 
Now let u be of the fonn <Uj u 2 ... u n > and assume that for each u j( T(Uj) is maximal. If T(u) is a 
non-terminal symbol then it is maximal, on the other hand if T(u) is not a non-terminal symbol 
then it is an n-tuple of maximal expression fonns which is not tire right hand side of any production 
and is therefore maximal. 

The proof that for any expression v there is at most one maximal expression form which 
generates v is done by induction on v. For a terminal symbol a if there is no production of the form 
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X=>a in ilic grammar then a is the unique maximal expression form which generates a. On the 
other hand if there is a production of the form X=^a then since the grammar is normal there is only 
one such production and the non-terminal symbol X is the unique maximal expression form which 
generates a. Now let v be an expression of the form <vj v-,... v n > such that for each vj there is at 
most one maximal expression form which generates Vj. Since r(Vj) is maximal and generates Vj it 
the unique such maximal expression form. Now there are two eases. The first case occurs when 
there is a non-terminal symbol X such that X=s>-*v. Since the grammar is normal it must contain a 
production of the form X=XX^X 2 X n > where each Xj is a non-terminal symbol such that 
Xi=>*Vi_ However since each non-terminal symbol is maximal, Xj must equal T(vj) and therefore 
the production X=XT(Vj) T(v 2 )... T(v n )> is in the grammar. This must hold for every 
non-terminal symbol X which generates v, i.e. for each X which generates v the above production 
must be in the grammar. However since there can be at most one production whose right hand side 
is <X(vj) r(v 2 )... r(v n )> there can be at most one non-terminal symbol which generates v. Note 
that in this case the expression form <T(Vj) r(v 2 )... r(v n )> is not maximal. The second case occurs 
when there exists a maximal expression form a which generates v but which is not a non-terminal 
symbol, i.e. there exists an expression form a which generates v and is of the form <o^ a 2 ... a n >. 
Since a is maximal each a { must also be maximal and since a generates v, oq must generate vj for 
each i. However since there is at most one maximal expression generating Vj, aj must be r(v-) so a 
must be <T(vj) T(v 2 )... r(v n )>. Note that in this case <T(vj) T(v 2 )... T(v n )> is maximal and 
therefore the first and second case can never occur at the same time. So either there is a unique 
non-terminal symbol which generates v or there is no such non-terminal and <r(vp T(v 2 )... T(v n )> 
is the unique maximal expression which generates v. 

The relation is substitutive, i.e. is a congruence relation. To see this let u be an expression 
of the form <u^ u 2 ... u n > and let v be any expression of the form <v^ v 2 ... v n > . It suffices to show 
that if r(uj) equals f(Vj) for each i, then T(<u^ u 2 ... u n >) equals T(<v^ v 2 ... v n >). However this 
follows directly from the definition of T. 

To establish that the language generated by T(u) is precisely |u|~^ first note that if v is in 
lul^ , i.e. if T(v) equals T(u), then since r(v)=>*v it must be the case that r(u)=>*v. On the 
other hand if r(u)=>*v then since there is at most one maximal expression form which generates v, 
f(u) equals T(v) and therefore v is in 

Finally it must be shown that the congruence relation defined by T is finitely cquational, i.e. 
that there is a finite set of equalities 2 such that ~p is the same as A non-terminal symbol X 
will be called coherent if the language'generated by X is not empty. A production or expression 
form will be called coherent if it contains only coherent non-terminal symbols.' Each coherent 
non-terminal symbol X can be associated with some expression u(X) such that X=»*u(X). Now for 
each coherent expression form a, u(a) is defined as the result of replacing each non-terminal 
symbol X by u(X); clearly a=>*u(a). Now let 2 be the set of equalities of the form u(X) = u(a) 
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where X=>a is a coherent production of G. Now if u(X) = u(a) is an equality in 2 then X 
/"■N generates both u(X) and u(«) under,G; Thus if u(X) = u(a) is an equality in 2 then F(u(X)) equals 

r(u(a)) and therefore u(X)~pu(a). Thus every equality in 2 is subsumed by ~p and since 
is a congruence relation every equality dcducible from 2 is therefore also subsumed by ~p. It 
remains only to show that every equality subsumed by ~p is dcducible from 2. This is done by 
showing via a standard induction on expressions that for any expression v the equality v = u(T(v)) is 
dcducible from 2 (the details are left to the reader). Thus if the equality w = v is subsumed by ~p 
then since T(w) equals T(v) and the equalities v = u(F(v)) and w = u(T(w)) are both dcducible from 
2, the equality w = v is dcducible from 2. 

The following lemma completes the relationship between normal grammars and finitely equational 
congruence relations by showing that every finitely equational congruence relation can be represented by a 
normal grammar. 
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Lemma 2.3: For any finite set of equalities 2 there is a normal context free expression 
grammar G with maximal generator function T such that ~p is the same as 
Furthermore G can be computed from 2 in order nlog(«) average time where n is the total 
size of 2. 

proof: Let D be the finite set of expressions contained in 2 either directly or as a subexpression of 
some expression contained in 2. Downey, Tarjan, and Sethi [5] show how to construct a function F 
on D such that for any two expressions u and v in D, F(u) equals F(v) just in case the equality u = v 
follows from 2. Furthermore a tabular representation of the function F can be computed in 
«log(«) average time where n is the total size of 2. Such a function F can be translated into a 
normal context free expression grammar G in the following way: Each expression v in D is 
associated with a non-terminal symbol X(v) such that for any two expression u and v in D, X(u) 
equals X(v) just in case F(u) equals F(v). Now for each terminal symbol a in D we include the 
production X(a)=s>a and for each expression v in D of the form <Vj vj ■■■ v n > we include the the 
production X(v)=><X(vp) X^)... X(v n )>. The grammar G can be computed from F in time 
proportional to the total size of 2. 

To show that G is normal it is sufficient to show that no two distinct productions have the same 
right hand side. Let u and v be two expressions in D such that the production corresponding to u 
has the same right hand side as the production corresponding to v. It is sufficient to prove that X(u) 
must equal X(v) since then the production corresponding u would be the same as the production 
corresponding to v. Since the right hand sides of tire productions corresponding to u and v are the 
same either u and v arc both the same terminal symbol (in which case the result is trivial) or u is the 
form <up U 2 ... u n >, v was of the form <vj v ^ ... v n >, and X(up equals X(vj) for each i. However in 
the latter case each equality Uj = Vj would be dcducible from 2 and therefore by substitutivity the 
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equality u = v would also be deduciblc from 2 and thus Xfu) would equal X(v). 

It remains to show that ~p is the same as where T is the maximal generator function of 
G. First note that if u = v is an equality in 2 then u and v are in D and X(u) must equal X(v). 
Furthermore it is easily shown by induction that for any expression u, if u is in D then r(u) equals 
X(u). Thus if u = v is an equality in 2 then T(u) equals F(v) and thus ~p subsumes 2. 
Furthermore lemma 2.2 ensures that ~p is a congruence relation and therefore subsumes any 
equality deduciblc from 2. It remains only to show that any equality subsumed by ~p is 
deduciblc from 2. To show this first note that for each non-terminal symbol Y in G there is at least 
one expression w in D such that Y equals X(w). Thus each non-terminal symbol Y can be 
associated with an expression u(Y) in D such that X(u(Y)) equals Y. Now for any expression form 
a let u(a) be the result of replacing each non-terminal symbol Y in a by u(Y). It can now be shown 
by a standard induction that for any expression w die equality w = u(T(w)) is deducible from 2 (the 
details are left to the reader). Finally if an equality w = v is subsumed by ~p then since T(w) 
equals T(v), and the two equalities w = u(T(w)) and v = u(T(v)) are both deducible from 2 the 
equality w = v can also be deduced from 2. 


A set of expressions L will be called a context free expression language if there is a context free 
expression grammar G and a non-terminal symbol X of G such drat L is the language generated X under G. 
Now let G be any normal grammar and let T be the maximal generator function of G. Note that for any 
expression u, if f(u) is a non-terminal symbol of G then |u|~ is a context free expression language. 


Furthermore if T(u) is not a non-terminal symbol then one can simply construct a new grammar G’ by adding 
a new non-terminal symbol X and the production X=>T(u) so that |u|~^ is the language generated by X 
under G’. In either case is a context free expression language. These observations and the above 

lemmas now yield the main result of this section: 


Theorem 2.4 For any finite set of equations 2 and any expression u, |u|^ is a context free 
expression language. 


At this point it is useful to introduce another simple example. Consider the following three 
equations: 


<f<f<fa»> = a 
<fa> = b 
<fb> = c 


These equations correspond to the following normal grammar: 
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A => a | <F C> 

B => b | <F A> 

C => c | <F B> 

F => f 

Under this grammar T(<g <f b») equals <g C> and the equivalence class of <g <f b» equals the set 
of expressions generated by <g C>. What expressions are there, if any, which are both equivalent to <g <f b» 
and contain only the symbols g, f and a? The answer is the set of expressions generated by the expression 
form <g C> under the following subset of productions: 

A => a | <F C> 

B => <F A> 

C=^<FB> 

F => f 

In general consider a division of the terminal symbols into "dependent" and "independent" 
symbols. The following theorem provides a technique for solving for expressions in terms of independent 
symbols. 

Theorem 2.5: Let G a normal grammar with maximal generator function T, and let G’ be 

(F* .^ 

that subset of the productions of G which do not contain any dependent symbols. For any 
expression u the language generated by F(u) under G’ is precisely the set of expressions 
which are equivalent to u under and which do not contain any dependent symbols. 

Proof: Any expression generated by F(u) under G’ contains no dependent symbols and is 
equivalent to u under i.e. is a solution for u. On die other hand any solution v for u must be 
equivalent to u and therefore must be generated by T(u) under G. But in this case since v does not 
contain any dependent symbols the derivation of v from T(u) must not involve any productions 
which contain dependent symbols. Thus v must be generated by T(u) under G’ as well. 


/“S 
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3. BOOLEAN CONGRUENCE RELATIONS 


Now that a good representation has been deveioped for equivalence classes under finitely equational 
congruence relations we turn our attention to Boolean congruence relations. An equational Boolean formula 
over an alphabet A is a formula which is built up from equalities between expressions over A using negations, 
disjunctions, conjunctions, and implications in the standard way. The congruence closure of an equational 
Boolean formula Q is defined to be the congruence relation ~q on expressions such that u~qv just in case 
u = v is dcduciblc from Q using the standard deduction rules for Boolean connectives and the deduction rules 
for equalities described earlier. For an arbitrary expression u the equivalence class of u under the congruence 
closure of Q will be denoted by |u|q. It will be shown in this section that |u|q is either tire universal rclauon 
or a context free expression language. 

Let P be an equational Boolean formula which is a conjunction uj = v^ A A ug = ... of 

equalities and negations of equalities and let 2 be the set of equalities which appear in positive form in P. 
Using the techniques described in the previous section one can compute a normal expression grammar G with 
maximal generator function T such that ~p is the same as Now the conjunction P is satisfiable just in 
case diere is no negated equality Uj^vj appearing in P such that the equality Uj = Vj is deducible from 2, or 
equivalently such that T(uj) equals r(vj). If the conjunction is unsatisfiable then anything follows from P and 
thus the congruence closure of P is- the universal relation, If the conjunction P is satisfiable then the 
congruence closure of P is just the congruence closure of 2 and for any expression u, |u|p equals |u|^ which 
equals the language generated by T(u) under G. 

Any equational Boolean formula can be converted to disjunctive normal form, i.e. is equivalent to a 
disjunction of conjunctions of equalities and negations of equalities. Let Q any equational Boolean formula 
and let Pj V P 2 V...V P n be a disjunctive normal form of Q where each Pj is a conjunction of equalities and 
negations of equalities. An equality u = v follows from Q just in case it follows from each Pj. In other words 
~q is the intersection of the ~p.’s. But by the above remarks the congruence closure of a given Pj is either 
the universal relation or the congruence closure of the positive equalities in that conjunction. These remarks 
lead to the following lemma: 

Lemma 3.1: The congruence closure of any equational Boolean formula is either the 

universal relation or a finite intersection of finitely equational congaience relations. 

Given the above observations it is clear that one can decide whether an equality u = v follows from a 
formula Q by first converting Q to disjunctive normal form and then seeing if u-v follows from each 
disjunct. However it is not clear that a mechanism which can solve for u under a set of equalities can be 
extended to a mechanism for solving for u under an arbitrary equational Boolean formula. What is needed is 
a good characterization of the equivalence class of u under Q. As a simple example consider the following 
formula in which <f n a> is an abbreviation for an expression of the form <f <f...<f a>...» where the symbol f 
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appears n times. 

/'■"N 


<f® a> =a V <f^ a> = a 


The equivalence class of a under this formula is the set of expressions of the form <f n a> where n is a 
multiple of 35. One approach to the problem of solving for expressions under a Boolean formula Q would be 
to try to find a finite set of equalities 2 such that ~q was the same as The following theorem 
demonstrates that this can not be done in general. 

Theorem 3.2: The intersection of two finitely equational congruence relations need not be 
finitely equational. 


Proof sketch: Consider the following pair of sets of equations: 


2 i: {a = b} 

^ 2 : {<f a> = a, <f b> = b, <g a> = <g b>} 

The intersection of anc * * S Same 3S '“"'Q w * iere Q ^ Boolean formula: 

a=b V (<f a> = a A <f b>=b A <g a>=<gb>) 

The relation ~q is the same as the relation where n is the infinite set of equalities of the 
form: 


<g <f n a» = <g <f n b» 


Now ~j-[ is finitely equational just in case there is a finite set of equalities 2 such that is 
the same as ^such a set 2 exists then if u = v is an equality in 2 then u~j-jv and thus the 
equality u = v must be deducible from IT. But an equality is dcducible from IT just in case it is 
deducible from some finite subset of IT and thus all of the equalities in 2 must be deducible from a 
single finite subset of II. Thus ~jq is finitely equational just in case it is the congruence closure of 
some finite subset of II. However given the above definition of II no finite subset of II can imply 
all of the equalities in II so ~j~j is not finitely equational. 


The above theorem says that there is a Boolean formula Q such that ~q is not finitely equational. 
However it is still possible that for each expression u there is a finite set of equalities 2 such that |u|q equals 
Mji (remember that there are infinitely many such expressions u). 

Let Q be an arbitrary equational Boolean formula. By Lemma 3.1 it is possible either to determine 
that the congruence closure of Q is the universal relation or to find a finite collection {2^ 2j ... 2 n } of finite 



3. BOOLEAN CONGRUENCE RELATIONS 


-14- 


May 1983 


sets of equations such that the congruence closure of Q is the intersection of the congruence closures of the 
Ej’s. Thus for any expression u we have the following relation: 

Mq = M^niui^n-lul^ 

Furthermore for each there is a normal grammar Gj with an associated maximal generator 
function Tj such that lul^. is the language generated by T^u) under Gj. This implies that |u|q is the 
intersection of a finite collection of context free expression languages. While it is well known that the 
intersection of two context free string languages need not be a context free language there are fundamental 
differences between string languages and expression languages. Specifically expressions have a-priori phrase 
structure while strings do not. This difference is responsible for the fact that membership in a context free 
expression language can be determined in linear time (as was mentioned in the previous section) and it is also 
responsible for the following theorem: 

Theorem 3.3: The intersection of two context free expression languages is a context free 
expression language. 

Proof. An expression grammar will be called one level if every production is either of the form 
X=>a where a is a terminal symbol or of the form X => <Xj X 2 ... X n > where each Xj is a 
non-terminal symbol. (A normal grammar is a one level grammar in which no two productions 
have the same right hand side.) It is easy to show for any context free expression language L there 
is a one level grammar G such that L is the language generated by a non-terminal symbol of G. 
Given these remarks Theorem 3.3 follows directly from the following Lemma. 

Lemma 3.4: For any two one level expression grammars G^ and G 2 there exists a grammar 
denoted as G 2 HG 2 such that for each non-terminal X of G^ and non-tenninal Y of G 2 
there exists a non-terminal of G 2 DG 2 denoted by Xfl Y such that the language generated by 
XDY under GjHG 2 is the intersection of the languages generated by X and Y under G^ 
and G 2 respectively. 

Proof: Given G| and G 2 define GjHG 2 to be the one level grammar meeting the following 
conditions: 

1) The set of non-terminal symbols of G j(TG 2 is the cross product of tire non-terminal symbols of 
Gj and the non-terminal symbols of G 2 . In other words for each non-terminal symbol X of Gj 
and non-tenninal Y of G 2 there exists a non-terminal symbol of G^f^ which'will be denoted 
here by XDY. 

2) For each terminal symbol a a production XflY => a is in GjHG 2 just in case the production 
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X => a is in G j and the production Y => a is in G 2 . 

3) A production of the form XflY => <XjHYj X 2 HY 2 ... X n nY n > is in G^C^ just in ease die 
production X => <Xj X 2 ... X n > is in Gj and the production Y => <Y^ Y 2 ... Y n > is in G 2 . 

Let u be any expression over terminal symbols. It will first be shown by induction on u that if 
XnY=s>*u then X=>*u and Y=s>*u. If u is a terminal symbol then this result follows directly from 
tire definition of G]HG 2 . If u is of tire form <Uj U 2 ... u n > then if XHY=i-*u then there must be a 
production of G|HG 2 of the form XflY => <X|flYj X 2 HY 2 ... X n nY n > such that XjnYj=>*Uj 
for each i. By the definition of G}nG 2 this implies that the production X => <X^ X 2 ... X n > is in 
G^ and the production Y => <Y| Y 2 ... Y n > is in G 2 . Furthermore by the induction hypothesis it 
must be the case diat X-=>*Uj and Yj=>*Uj for each i. Therefore it must be the case that X=>*u 
and Y=> *u and the induction is complete. 

Now it will be shown by induction on expressions that ifX=>*u and Y=>*u then XDY=>*u. 
Again if u is a terminal symbol then the result follows directly from tire definition of GiflG 2 . If u 
is of the form <u^ U 2 ... u n > then if X=>*u and Y=s>*u there must be a production in G| of the 
form X => <X^ X 2 ... X n > and a production in G 2 of the form Y => <Y^ Y 2 ... Y fl > such that 
Xj=>*Uj and Yj=s>*Uj for each i. But by the definition of G|flG 2 this implies that the production 
XHY => <X 1 DY 1 X 2 flY 2 ... X n fTY n > is in G^f^. Furtliermorc by the induction hypothesis 
/""’N XjfTYj=i > *Uj for each i. Therefore XDY=>*u and the induction argument is complete. 

Finally since XHY=>*u just in case X=>*u and Y=>*u the language generated by XHY 
under G^f^ is exactly the intersection of the language generated by X under G^ with the 
language generated by Y under G 2 . 

Theorem 2.4, Lemma 3.1, and Theorem 3.3 now immediately imply the following corollary: 

Corollary 3.5: For any consistent equational Boolean formula Q and any expression u, |u|q is 
a context free expression language. 

For any two grammars G^ and G 2 the size of die grammar G^F^ is proportional to the product of 
the sizes of the grammars G^ and G 2 . However it seems reasonable to expect that for most pairs X, Y of 
non-terminal symbols of Gj and G 2 respectively the language generated by X and the language generated by 
Y will be disjoint and therefore the language generated by XHY will be empty. A non-terminal symbol of a 
grammar will be called coherent if tire.language generated by that symbol is not empty. The coherent 
fragment of a grammar G is defined to be the grammar resulting from removing all productions which contain 
non-coherent non-terminal symbols. The coherent fragment of a grammar can be computed in time 
proportional to the size of that coherent fragment. 

The fragment of GjHG 2 which is relevant to describing the language generated by XflY can be 
restricted even further. A non-terminal symbol Z will be said to be accessible from the non-terminal W if 
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there is a production whose left hand side is W and whose right hand side contains Z, or if there is a 
non-terminal V such that V is accessible from W and Z is accessible from V. For a given grammar G the 
fragment of G accessible from W is the subset of productions of G which contain only symbols accessible 
from W. The fragment of G accessible from W can be computed in time proportional to the size of that 
fragment. If one is interested in the language generated by XDY under GiC^ then one need only consider 
the fragment of G^nG 2 which is accessible from XHY. 

Of course the intersection of a large set of context free expression languages can be done by 
iteratively applying the intersection algorithm implicit in the proof of Lemma 3.4. In the worst case the size of 
the grammar can grow exponentially in die number of languages which are intersected. Flowever it is 
expected that such exponential growth will not usually arise since at each step one can restrict the grammar to 
the coherent subset which is accessible from the non-terminal of interest. 

The mechanisms that have been described so far can be extended to deal with conditional 
expressions. For a given alphabet A of terminal symbols one can define a conditional expression to be either a 
simple expression (a non-conditional expression) or an n-tuple <if P u ^ U 2 > where P is an cquational Boolean 
formula and Uj and U 2 are conditional expressions. As was mentioned earlier it is straightforward to give a 
semantics for expressions and Boolean formulas. Such a semantics can be extended to a semantics of 
conditional expressions by defining the denotation of <if P u^ U 2 > to be the denotation of Uj if P is true and to 
be the denotation of U 2 if P is false. Now for a conditional expression w and a Boolean formula Q, |w|q is 
defined to be the set of non-conditional expressions which equal w in all interpretations which make Q true. 
This definition yields the following relation: 

|<ifPu 1 u 2 >|q = l^ilpAQ n l u 2f(—<P)AQ 

Given the results of this section the above relation provides a way of computing a grammar 
representing |w|q for any conditional expression w and consistent Boolean formula Q. Note that since 
elements of jw|g must be non-conditional expressions |w|q may be empty. 
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4. SIMPLIFICATION 

A simplicity order on expressions is a partial order such that u is simpler than v just in ease u is less 
than v under that order. For the purposes of this section let ~ be an arbitrary congruence relation on 
expressions and for any expression u let lul^ denote the equivalence class of u under Note that since the 
simplicity order need not be total or well founded there need not be a unique simplest expression in |u|~. 
The following definitions are important in discussing simplification under congruence relations. 

Definitions: A simplicity bound under ~ is an expression w such that there is no expression 
in Iwl^ which is simpler than w. An expression w will be called simplified under ~ just in 
case it and all of its subexpressions are simplicity bounds under ~. A simplification of an 
expression u under ~ is an expression in |u|~ which is simplified under Si. 

The main result of this section is that for any "well behaved" simplicity order and any consistent 
equational Boolean formula Q the set of simplifications of an expression u under ~q is a context free 
expression language. However it is important to note that there are "pathological" simplicity orders for which 
this result does not hold. In particular let 2 denote the single equation <f a> = a. Clearly |a|-£ is the set of all 
expressions of the form <{** a>. Now suppose that the simplicity order was such that larger expressions were 
always simpler than smaller expressions. Since the set |a|^ contains arbitrarily large expressions it would not 
contain a simplicity bound. To eliminate this problem one might required that the simplicity order be well 
founded, i.e. drat there does not exist any sequence of ever simpler expressions. However even under this 
restriction there are pathological orderings. Suppose for example that all expressions other than a are equally 
simple and but simpler than a. In this case any expression of die form <f” a> (for n greater than 0) is a 
simplicity bound for |a|^. However all expressions in |a|^ contain a which is not a simplicity bound and 
therefore no expression in |a|^ is a simplification of a. The following definition establishes a class of 
non-pathological or well behaved simplicity orders. Note that well behaved orders need not be well founded. 

Definition: A well behaved simplicity order on expressions is defined here as a partial order 
on expressions satisfying the following conditions: 

1) The order is pseudo-total, which means that there is a function L from expressions to a 
totally ordered set such that u is simpler dian v just in case L(u)<L(v). Such a function L will 
be called a totalizer for die simplicity order and L(u) will be called the simplification level of • 
u. Note that not all partial orders arc pseudo-total, in particular consider an order and three 
objects x, y, and z such diat x is less than y but z is unordered with respect to both x and y. If 
such an order were pseudo-total then since z and x arc unordcred L(z) would have to equal 
L(x). Similarly L(z) would equal I.(y). But then I,(x) would have to equal L(y) which 
conflicts with the ordering between x and y. 
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2) The order satisfies the following monotonicity condition: A context is defined to be an 

/"N expression with an "open slot" in it. For any expression w and any context C let C[w] be the 

result of replacing the open slot of C with w. For example if C is the context <f <g a> - b> 
then C[<g b>] is the expression <f <g a> <g b> b>. The monotonicity condition is that for any 
two expression u and v and any context C, if L(u)<L(v) then L(C[u])<L(C[v]) where L is 
any totalizer for the simplicity order. 

3) No expression is simpler than one of its subexpressions. 

Conditions on simplicity orders, such as the monotonicity condition above, can often be most readily 
expressed in terms of a totalizing function L. Such conditions can always be thought of as direct conditions 
on the simplicity order and therefore the truth of such conditions is independent of the choice of the totalizing 
function. In the remainder of this section L will always denote a totalizer for the simplicity order. 

The monotonicity condition on well behaved simplicity orders warrants some investigation. First it 
is easily shown that the monotonicity condition implies that if L(u) equals L(v) then for any context C, 
L(C[u]) equals L(C[v]) (if L(u) equals L(v) then L(C[u])<L(C[v]) and L(C[v])<L(C[u])). Thus the level of an 
expression is determined by the levels of its subexpressions. This implies that any well behaved simplicity 
order can be characterized by a triple <D, Lj, Lf> where D is a totally ordered set of "simplicity levels", L] is 
a function from symbols to D, and Iv> is a mapping from n-tuples of elements of D into D such that the 
following relations hold: 

L(a) = Lj(a) for symbols a 
L(<uj u 2 ... u n >) = L 2 (<L(u 1 ) L(u 2 ) ... L(u n )>) 

Note that the above relations could be taken as a definition of L in terms of and L 2 . The 
condition that no expression is simpler than one of its subexpressions is equivalent to the following condition 
on Lj. 

L 2 (<L(u 1 ) L(u 2 ) ... L(u n )>) > max(L(Uj) L(u 2 )... L(u n )) 

Unfortunately the above conditions on well behaved simplicity orders are not quite strong enough to 
ensure that every expression has a simplification. There arc now two ways of ensuring this. The first is to 
require that the simplicity order be well founded. The second is to require that the congruence relation be 
~q for some consistent Boolean formula Q. The next theorem establishes that the well foundedness 
condition is sufficient for arbitrary congruence relations. 


Theorem 4.1: For any well founded well behaved simplicity order and any congruence 
relation every expression has a simplification. 
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Proof: An expression w will be called "cleaner” than an expression u just in case the pair<L(w) w> 
is less than the pair <l.(u) u> under a lexicographical ordering where the second components of the 
pairs are compared under the subexpression order. In other words w is cleaner than u just in case 
I.fw) is less than L(u) or L(w) equals L(u) but w is proper a subexpression of u. Since well behaved 
simplicity orders are well founded the cleanliness ordering is also well founded, i.e. there are no 
infinite chains of ever cleaner expressions. 

'Fhe proof that a simplification always exists will be done by induction on cleanliness. For an 
arbitrary expression u it is assumed that all expressions cleaner than u have simplifications. It is 
sufficient to show that this implies that u has a simplification. 

If there is an expression w in |u|^ which is simpler than u then w is cleaner than u and 
therefore has a simplification which must also be a simplification of u. On the other hand suppose 
that there is no expression in lul^ which is simpler than u (i.e. u is a simplicity bound). Because no 
expression can be simpler than one of its subexpressions if v is a proper subexpression of u then 
L(v)<L(u) and therefore v must be cleaner than u. Thus by the induction hypothesis all 
subexpressions of u have simplifications. Let w> be the result of replacing all of the top level 
subexpressions of u with simplifications of those expressions. By substitutivity w is in |u|~ and all 
proper subexpressions of w are simplicity bounds. To show that w is a simplification of u it remains 
only to show that w is a simplicity bound for However since u is a simplicity bound and since 
the monotonicity condition implies that L(w)<L(u), w must also be a simplicity bound. 

For a congruence closure ~q of a consistent Boolean formulae Q the well foundedness condition 
can be removed from the above theorem. The first step in establishing this result is to show that for any 
consistent Boolean formula Q and expression u there is a grammar G which captures all of the information in 
Q relevant to simplifying u. The precise conditions placed on the grammar G is defined below in terms of u 
and ~q. 


Definition: Let G be any normal grammar with maximal generator function T. The 
grammar G will be said to cover an expression v just in case T(v) is a non-terminal symbol. 
The grammar G will be said to approximate a congruence relation ~ just in case |v|—. equals 
the language generated by T(v) for all expressions v which are covered by G. 


lemma 4.2: For any .consistent equational Boolean formula Q and any expression u there is a 
normal grammar G which approximates ~q and which covers u. 

Proof: It was demonstrated in the previous section that if Q is consistent then there exists a finite 
set {2j ^2 ... 2 n ) of finite sets of equations such that the relation ~q equals the intersection of 
the relations For each 2- there exists a normal grammar Gj with maximal generator function 

T: such that ~ r equals . It can be assumed without loss of generality that each G; covers u. 
1 1 i n J 
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Now using the technique developed in the proof of lemma 3.4 one can construct the grammar 
/""'N GjflG 2 ...nG n such that the language generated by each non-terminal XjnX 2 ...flX n of 

G 1 nG 2 ...nG n is precisely the intersection of the languages generated by each Xj. From the 
definition of this grammar given in the proof of lemma 3.4 it is easy to show that this grammar is 
normal. To show that this grammar approximates 25 q let v be any expression covered by 
GinG 2 ...nG n . The non-terminal of GjflG 2 ...nG n which generates v must be 
r 1 (v)nr 2 (v)...nr (v) and the language generated by this is precisely the intersection of the sets 
|v|^ which is precisely |v|q. Finally since u is covered by each G^ the nonterminal 

T 1 (u)nr 2 (u)...nr (u) generates u and thus u is covered by G 2 flG 2 ...nG n . 


It is interesting to note that a normal grammar G can approximate a congruence relation ~ even if 
25 is not finitely equational. In particular the relation ~p, where T is the maximal generator function of G, 
need not be the same as 25 q; if w is not covered by G then jwl^ may be a proper subset of Iwl^. 


The following lemma tightens the relationship between a congruence relation 25 and a grammar G 
which approximates it. 


lemma 4.3: If a normal grammar G approximates a congruence relation 25 and u is an 
expression covered by G dien the set of simplifications of u under 25 equals the set of 
/'■“'N simplifications of u under 25p where f is the maximal generator function of G. 


Proof. First note that an expression covered by G is a simplicity bound under 25 just in case it is a 
simplicity bound under 25p. This is because if v is an expression covered by G then Ivl^ equals 
ivlj-w^. Second note that if v is an expression covered by G then since G is normal (and therefore 


one level) each subexpression of v must be generated by some non-terminal symbol of G. 
Therefore each subexpression of an expression covered by G is covered by G. The above two 
observations imply that an expression covered by G is simplified under 25 just in case it is 
simplified under 25p. Finally note that every expression in lul^ is covered by G since |u|~ is 
precisely the set generated by the non-terminal symbol T(u). Thus the set of simplifications of u 
under 25 consists of those expressions in jul^ which are simplified under 25p, i.e. the set of 


simplifications of u under 25p. 


The following theorem finally establishes the desired result concerning the existence of 
simplifications under consistent Boolean formulae. 

Theorem 4.4: For any consistent Boolean fonnula Q, any well behaved simplicity order, and 
any expression u, u has a simplification under 25 n . 

Given lemmas 4.2 and 4.3 the above theorem follows from the following lemma: 
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Lemma 4.5\ Let G be any normal grammar with maximal generator function P. For any 

well behaved simplicity order every expression covered by G has a simplification under ~p. 

The above lemma will be shown by constructing an algorithm which takes a normal grammar and 
assigns each non-terminal symbol X an expression s(X) which is simplified under and which is generated 
by X. 'fhen for any expression u covered by G, s(r(u)) is a simplification of u. 

The procedure for constructing the assignment s is presented below. At each point in the 
computation s is defined on a subset of non-terminal symbols. Such a partial assignment s can be extended 
to expression forms of the fonn <a^ otj ... a n > by setting s(<a^ ... a n >) equal to <s(aj)s(a 2 )... s(a n )> 

whenever sfa^) is defined for each otj. 

At each point in the computation there is also a queue of pairs <X, u> where X is a non-terminal 
symbol and u is an expression over terminal symbols. Each pair <X, u> on this queue should be drought of as 
a constraint on the value of s(X) which states that L(s(X)) must be less than or equal to L(u). This queue is 
always sorted with respect to the terminal expressions such that for any two pairs <X, u> and <Y, v> on the 
queue if u is simpler than v dren the pair <X, u> appears earlier on die queue than the pair <Y, v>. Thus if 
<X, u> is die pair at the head of the queue then u must be at least as simple as any other terminal expression 
appearing in any other pair on the queue. The procedure maintains the following three invariants: 

1) For each non-terminal X such that s(X) is defined, X generates s(X) under G. 

2) For each pair <X, u> on the queue X generates u under G. 

3) Let <X, u> be die first pair on the queue. For any non-terminal symbol Y if s(Y) is 

defined dien L(s(Y))<L(u). 

The procedure is given below, initially s is totally undefined. 

1) For each production of the form X=>a where a is a terminal symbol place the pair <X, a> 

on the queue. 

2) Take the first pair <X, u> off the queue. If s(X) is undefined do the following: 

i) Set s(X) to u. 

ii) For each production of .the form Y=>a such that X appears in a and s(a) is 
defined, place the pair <Y, s(a)> on the queue. 

3) If the queue is not empty go to 2) 

The algoridim terminates because there arc only finitely many non-terminal symbols which can be 
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assigned simplifications. The first two invariants are that if s(X) is defined then X generates s(X) and if <X, u> 
is a pair on the queue then X generates U. These two invariants are easily shown to be true in the initial state 
since the function s in initially totally undefined and every pair on the queue is of tine form <X, a> where a is a 
terminal symbol and X=>a is a production of G. Given that these invariants hold it can easily be shown that 
no step of the procedure causes cither invariant to be violated. 

The final invariant is that for any non-terminal Y, if s(Y) is defined and <X, u> is the first pair on the 
queue then L(s(Y))<L(u). Because the queue is sorted if <X, u> is the first pair on the queue then L(u) is the 
least simplicity level assigned any terminal expression on the queue. Thus the final invariant is equivalent to 
the statement that whenever s(Y) is defined, s(Y) is at least as simple as any terminal expression on the queue. 
This invariant holds trivially in tine initial condition since the function s is totally undefined. Furthermore 
simply removing a pair from the queue can never cause this invariant to be violated. The only times the 
invariant might be violated is when s(X) is defined for some X or when a pair <Z, w> is added to the queue. 

Consider an execution of step 2) of the procedure in which a pair <X, u> is removed from the queue 
and s(X) gets set to u. Let <Y, v> be tine pair which is next on the queue when the pair <X, u> is removed. 
Given that the invariant is already in force L(s(Z)) must be less than or equal to L(u) for any non-terminal 
symbol Z such that s(Z) was defined. Furthermore L(u) is less than or equal to L(v) and thus L(s(Z)) is less 
than or equal to L(v) for any non-terminal symbol Z such drat s(Z) was defined. Further since L(u) is less 
dian or equal to L(v), L(s(X)) will be less than or equal to L(v) when s(X) is set to u. Thus the removal of 
<X, u> from the queue and the setting of s(X) to u does not violate the invariant. 

Now consider the queueing of any pair of the form <Z, s(«)>. Let <Y, v> be the first pair on the 
queue before the pair <Z, s(a)> is added. For every non-terminal symbol W such that s(W) is defined 
L(s(W))<L(v). Given this fact the monotonicity condition on the simplicity order can be used to show (via a 
simple induction on expression forms) that for any expression form a such that s(a) is defined L(s(a))<L(v). 
Thus when the pair <Z, s(a)/ is queued L(s(a)) is less than or equal to L(v) so the simplicity level of the first 
pair on the queue remains the same and die invariant is not violated. 

It will now be shown that if X is any coherent non-terminal (i.e. the language generated by X is not 
empty) then s(X) gets defined by the above procedure and that s(X) is a simplicity bound under ~p. It is 
sufficient to show by induction on expressions that if v is any expression covered by G then s(T(v)) gets 
defined and L(s(F(v)))<L(v). If v is a terminal symbol covered by G dicn the pair <T(v), v> will be placed on 
die queue in step 1) of the procedure. This ensures that s(T(v)) will become defined and that 
L(s(T(v)))<L(v). Now suppose v is an expression covered by G which is of the form <v^ v 2 ... v n >. Since v is 
covered by G each Vj must also be covered by G and die production r(v)=><r(vj) r(\ ; 2 ) ... T(v n )> must be a 
production of G. Since each Vj is covered by G the induction hypodicsis implies that s(F(vj)) gets defined and 
that L(s(r(vj)))<L(Vj). Thus the monotonicity condition on die simplicity order implies that: 

L«s(r(v 1 )) s(r(v 2 ))... s(r{v n ))»<L«v 1 v 2 ... v n ». 


Now since die production r(v)=s><r(v^) T(v 2 )... T(v n )> is in G and since s(f(V|)) gets defined for 
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each vj the pair <r(v)<s(F(vj)) s(r(v 2 ))... s(r(v n ))>)> gets placed on the queue. Therefore s(F(v» gets 
defined such that: 

L(s(r(v))) < L(<s(r( Vl )) s(r(v 2 ))... s(r(v n ))» 

Combining the above two relations we get: 

L(s(r(v)) < Uv) 

It has been shown that for each coherent non-terminal symbol X that s(X) gets defined and that s(X) 
is a simplicity bound under Note that given the way pairs are placed on the queue and the way s(X) gets 
assigned, if s(X) is not a terminal symbol then it can be written as <s(Yj) s(Y 2 )... s(Y n )> where each Yj is a 
non-terminal symbol. Thus every subexpression of s(X) is also a simplicity bound and therefore s(X) is 
simplified under This implies that for any expression v covered by G, s(T(v)) is a simplification of v. 

In analyzing the running time of this algorithm let |G| be the total size of the grammar G. For each 
pair placed on the queue there is a particular production X=>a of G such that u is s(a). A given production 
can be responsible for at most the queueing of one pair so that the number of pairs placed on the queue is 
order |G|. The time required to queue and remove order |G| elements from a sorted queue (a priority queue) 
is order |G|log(|G|) plus the time it takes to compute order |G|log(|G|) comparisons. A given production 
X=><X 2 X 2 ... X n > can be examined by part ii) of step 2) of the algorithm at most n times. Thus the total 
time spent in part ii) of step 2 (ignoring queueing time) is order |G|. Thus the total time taken by the above 
procedure is order |G|log(|G|) plus the time it takes to perform order |G|log(|G|) comparisons between 
expressions. 

The next theorem provides a representation for the set of simplifications of an expression under the 
congruence relation imposed by a normal grammar G. It turns out that the set of simplifications can be 
described by a grammar which is a subset of the productions of G. 

Lemma 4.6: For any well behaved simplicity order and normal grammar G with maximal 
generator function T there is a subset G' of the productions of G such that for any expression 
w the set of simplifications of w under is the language generated by F(w) under G\ 

Proof: Without loss of generality it may be assumed that every non-terminal of G is coherent. 

Thus for each non-terminal symbol X of G, L(X) can be defined to be the simplification level of 
any simplicity bound generated by X, i.e. L(X) is the minimum simplicity level assigned any 
/'“N expression generated by X. The totalizer L can be represented as a triple <D Lj L 2 > as described 

above. Given such a representation L(a) can be defined for an arbitrary expression fonn a via the 
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following relation: 

LKaj «2 - a n > ) = I-2(<L(a j) Ua 2 )... L(a n )>) 

The monotonicity condition on the simplicity order ensures that in general I.(a) is the 
minimum level assigned any expression generated by an expression form a. 

Let G’ be that subset of the productions X=>a of G such that L(a) equals L(X). Now it will be 
shown by induction on expressions that for any expression v covered by G, if T(v) generates v 
under G’ then L(v) equals L(r(v)). For any expression v covered by G, T(v) equals some 
non-terminal symbol X of G. If v is a constant symbol covered by G then if X generates v then the 
production X=>v is in G’ and by the definition of G’ this implies L(v) equals L(X). If v is of the 
form <vj v 2 ... v n > and X generates v under G’ then there must be a production X=><X 2 X 2 ... X n > 
of G’ such that Xj=>*Vj for each i and thus by the definition of G’ the following relations must 
hold: 


L(X) = U<X 1 X 2 ...X n >) 

= L 2 «L(X 1 )L(X 2 )...UX n )» 
and by the induction hypothesis: 

= L 2 «L(v 1 )L(v 2 )...L(v n )» 

= L(v) 

It will now be shown that for any expression form a, if a=>*v under G’ then L(v) equals L(a). 
This is done by induction on expression forms. If a is a terminal symbol then if a=>*v under G’ 
then a must equal v and the result is trivial. If a is a non-terminal symbol and a=>*\ under G’ 

then T(v) must equal a and the result follows from the above induction. If a is of the form 

<a| <* 2 ... a n > then v must of the form <v 2 v 2 ...v n > where aj=>*Vj for each aj. In this case the 
result follows from tire following relations: 

U» = L(<«| a 2 ... « n >) 

= L 2 «L( ai )L(a 2 )...L(a n )» 
and by the induction hypothesis: 


= L 2 «L(v 1 )L(v 2 )...L(v n )>) 


= L(v) 
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It follows from the above induction that if F(v)=>*v under G’ then L(v) is the least level 
/ m \ assigned any expression generated by r(v) under G and therefore v is a simplicity bound under 

~p. It will now be show that if r(v)=s>*v under G’ then v is simplified under ~p. Form the 
above observations it is sufficient to show that if r(v)=>*v under G’ and if v is of the form 
<v^v 2 ... v n > then for each vj, r(vj)=>*Vj under G\ The expression form F(v) is either a 
non-terminal symbol or tine n-tuple <X(vj) r(v 2 )... T(v n )>. In tine latter case r(vj)=>*v^ for each 
Vj. In the former case since G’ is a one level grammar each Vj must be generated under G’ by some 
non-terminal symbol of G, but the only non-terminal which could possibly generates v- under G ! is 
T(vj) and therefore r(vj)=>*vj for each vj. 

For any maximal expression fonn a the language generated by a under G is an equivalence 
class of ~p. Furthermore it has been shown that if a=>*v under G’ then v is simplified under 
~p. Thus for any expression u the language generated by T(u) under G’ is a subset of the 
simplifications of u. It now remains only to show that every simplification of u is generated by T(u) 
under G\ 

It will be shown by induction on expressions that if v is an expression which is simplified under 

~p then T(v)=>*v under G’. If v is a constant symbol which is simplified under ~p then either 

T(v) is v in which case the result is trivial or T(v) is a non-terminal symbol such that the production 

r(v)=>v is in G. But since v is simplified under ~p L(v) must equal L(T(v)) and thus the 

production T(v)=>v is in G\ Now let v be a simplified expression of the form <vi v 2 ... v n >. since 

1 z. II 

v is simplified so is each Vj and therefore by the induction hypothesis for each vj, r(vj)=>*vj under 
G’. Furthermore T(v) is either <r(vj) T(v 2 )... r(v n )> or a non-terminal symbol X such that the 
production X=><T(v 2 ) r(v 2 )... T(v n )> is in G. Thus it is sufficient to prove that in the case where 
r(v) is a non-terminal symbol X the production X=i><r(vj) T(v 2 )... r(v n )> is in G’, or equivalently 
to prove that L(<r(vp) T(v 2 )... r(v n )>) equals L(X). However since v is simplified L(X) equals 
L(v) and furthermore since each Vj is simplified L(T(Vj)) equals L(vj). This implies the following 
relations which establish the desired result: 

L«r( Vl ) r(v 2 )... r(v n )>) 

= L 2 «L(r( Vl )) ur(v 2 ))... L(r(v n »» 

= L 2 (<L(v 1 )L(v 2 )...L(v n )» 

= Uv) 

= UX) 

It is important to note that the subset G’ of the productions of G can be computed in a 
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straightforward way given an assignment s of simplified expressions to non-terminal symbols such as the one 
produced in the procedure described earlier. Specifically a production X=>a is in G’ just in case s(X) is not 
simpler titan s(a). 

The following theorem has now been established: 

Theorem 4.7: For any consistent equational Boolean formula Q, any expression u, and any 
well behaved simplification order the set of simplifications of u under ~q is a context free 
expression language. 
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5. CONCLUSIONS AND OPEN PROBLEMS 


The notion of an expression is more general than the notion of first order term. Many different 
logical formalisms consist at least in part of expressions in which the substitution of equals for equals is valid. 
Thus the concepts and algorithms discussed here can be used for valid inference in a wide variety of deductive 
systems. 

The relationship which has been established between finite sets of equations and context free 
expression grammars may have applications beyond tire deduction techniques and simplification algorithms 
discussed here. For example consider unification under a finitely equational congruence relation. ■ More 
precisely let u and v be two expressions containing variables and let 2 be a finite set of equations between 
ground expressions. For any substitution a and expression u let o(u) be the result of simultaneously 
replacing each variable of u with its image under a. The unification problem for tire finite set of equations 2 
is to characterize the set of ground substitutions a such that a(u)~ 2 a ( v )- An obvious first step in 
approaching this problem is to consider a grammar G which represents 2 and consider substitutions over the 
maximal expression forms of G (remember that each maximal expression form of G represents an 
equivalence class under 2). The details of a unification algorithm have not been worked out and it is not dear 
what the complexity of this problem is. Another extension would be to develop a unification algorithm for 
equational Boolean formulas (remember that for a Boolean formula Q, ~q need not be finitely equational). 

It is hoped that the material presented here will be applicable to various domains including program 
verification, compiler optimization techniques, and deductive data bases. 
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